CDS

Accession Number TCMCG052C07821
gbkey CDS
Protein Id CAB4284117.1
Location complement(join(19209307..19209429,19209939..19210046,19210137..19210380,19210611..19210846,19211042..19211232,19211383..19211620,19213682..19213900))
Organism Prunus armeniaca
locus_tag CURHAP_LOCUS39558

Protein

Length 452aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJEB37669, BioSample:SAMEA6812185
db_source embl accession CAEKDK010000006.1
Definition unnamed protein product [Prunus armeniaca]
Locus_tag CURHAP_LOCUS39558

EGGNOG-MAPPER Annotation

COG_category C
Description formamidase C869.04 isoform X1
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00524        [VIEW IN KEGG]
KEGG_rclass RC02432        [VIEW IN KEGG]
RC02810        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01455        [VIEW IN KEGG]
EC 3.5.1.49        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00630        [VIEW IN KEGG]
ko00910        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00630        [VIEW IN KEGG]
map00910        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCATCTTATGGACCAAGATTGGTTGTGCCTATAGATGTGAAGAAGAAACCATGGGAACAGAAGCTTCCACTTCACAACCGATGGCACCCGGACATACCACCGGTGGCAGAAGTTACAGTTGGAGATGTTTTTCGGGTCGAAATGGTGGATTTTAGTGGAGGTGGTATTACCAAAGAATACACAGCAGAGGACATCAAACATGCCAATCCCTCCATTGTTCATTATCTTAGTGGGCCAATTAGAGTTTTGGACAAGGATGGCACTCCAGCAAAGCCAGGTGATCTTCTGGCAGTTGAGATATGCAACCTGGGTCCTCTCCCAGGAGATGAATGGGGTTTCACAGCAACATTTGACAGAGAAAATGGAGGGGGTTTTCTCACTGACCATTTTCCTTGTGCAACCAAAGCTATTTGGTACTTTGAAGGGATATATGCGTACTCACCTCAAATACCAGGAGTGAGATTCCCAGGTTTAACCCACCCCGGAATAGTTGGAACAGCACCATCAATGGAACTCCTGAATATATGGAATGAAAGGGAGAGAGAACTTGAAGAAAATGGACTCAACTCTATGAAACTATGTGAGGTTTTGCATCAACGACCATTGGCTAACCTACCATCAACAAAAGGTTGCGTCCTCGGAGGGATCAAGGAGGGCACTCCTGAATGGGAAAAGATAGCCTTGGAGGCTGCAAGAACAATTCCAGGAAGAGAAAATGGTGGCAATTGTGACATTAAAAATCTTAGTAGTGGTTCAAAGATATACCTTCCAGTATTCATTGAAGGAGCAAATCTTAGCACTGGTGACATGCACTTTTCCCAGGGCGATGGTGAAATTTCATTCTGTGGAGCAATTGAGATGAGTGGTTTCCTGGAGCTCAAGTGTGAAATTATAAGGGATGGAATGAAAGAATACCTTACACCAATGGGGCCCACTCCTCTTCATGTGAACCCAATCTTTGAGATAGGCCCTGTTGAGCCAAGATTTTCAGAATGGTTGGTGTTTGAGGGCATCAGTGTTGATGAGAGTGGGAAGCAGCATTACCTAGATGCAACTGTTGCATATAAGCGTGCAGTACTGAATGCTATTGACTACCTCTCTAAATTTGGATACTCCAAAGAGCAGAGCTACCTTCTGTTGTCATGCTGCCCATGTGAGGGAAGAATTTCTGGAATAGTGGACTCTCCCAATGCCATGGCAACCTTAGCAATCCCAACAGCTATCTTTGACCAGGATATACGTCCAAGAGCAAACAAGGTGCCAGTTGGGCCTCGGATAGTGAGGAAACCAGATGTCCTGAAATGTACTTATGATGGAAATTTGGCAACTACAAGGAACCTTAGCTCAGCAACATAA
Protein:  
MASYGPRLVVPIDVKKKPWEQKLPLHNRWHPDIPPVAEVTVGDVFRVEMVDFSGGGITKEYTAEDIKHANPSIVHYLSGPIRVLDKDGTPAKPGDLLAVEICNLGPLPGDEWGFTATFDRENGGGFLTDHFPCATKAIWYFEGIYAYSPQIPGVRFPGLTHPGIVGTAPSMELLNIWNERERELEENGLNSMKLCEVLHQRPLANLPSTKGCVLGGIKEGTPEWEKIALEAARTIPGRENGGNCDIKNLSSGSKIYLPVFIEGANLSTGDMHFSQGDGEISFCGAIEMSGFLELKCEIIRDGMKEYLTPMGPTPLHVNPIFEIGPVEPRFSEWLVFEGISVDESGKQHYLDATVAYKRAVLNAIDYLSKFGYSKEQSYLLLSCCPCEGRISGIVDSPNAMATLAIPTAIFDQDIRPRANKVPVGPRIVRKPDVLKCTYDGNLATTRNLSSAT